Benford’s Law, Families of Distributions and a Test Basis

نویسندگان

  • John Morrow
  • Swati Dhingra
  • Ching-Yang Lin
  • Mian Zhu
چکیده

Benford's Law is used to test for data irregularities. While novel, there are two weaknesses in the current methodology. First, test values used in practice are too conservative and the test values of this paper are more powerful and hold for fairly small samples. Second, testing requires Benford's Law to hold, which it often does not. I present a simple method to transform distributions to satisfy the Law with arbitrary precision and induce scale invariance, freeing tests from the choice of units. I additionally derive a rate of convergence to Benford's Law. Finally, the results are applied to common distributions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Survival Distributions Satisfying Benford’s Law

Hill stated that “An interesting open problem is to determine which common distributions (or mixtures thereof) satisfy Benford’s law . . .”. This article quantifies compliance with Benford’s law for several popular survival distributions. The traditional analysis of Benford’s law considers its applicability to datasets. This article switches the emphasis to probability distributions that obey B...

متن کامل

Stigler’s approach to recovering the distribution of first significant digits in natural data sets

Benford’s Law can be seen as one of the many first significant digit (FSD) distributions in a family of monotonically decreasing distributions. We examine the interrelationship between Benford and other monotonically decreasing distributions such as those arising from Stigler, Zipf, and the power laws. We examine the theoretical basis of the Stigler distribution and extend his reasoning by inco...

متن کامل

Application of Benford’s Law in Analyzing Geotechnical Data

Benford’s law predicts the frequency of the first digit of numbers met in a wide range of naturally occurring phenomena. In data sets, following Benford’s law, numbers are started with a small leading digit more often than those with a large leading digit. This law can be used as a tool for detecting fraud and abnormally in the number sets and any fabricated number sets. This can be used as an ...

متن کامل

Benford’s Law: An Empirical Investigation and a Novel Explanation

This report describes an investigation into Benford’s Law for the distribution of leading digits in real data sets. A large number of such data sets have been examined and it was found that only a small fraction of them conform to the law. Three classes of mathematical model of processes that might account for such a leading digit distribution have also been investigated. We found that based on...

متن کامل

Evaluation of Large-scale Data to Detect Irregularity in Payment for Medical Services

Background: Sophisticated anti-fraud systems for the healthcare sector have been built based on several statistical methods. Although existing methods have been developed to detect fraud in the healthcare sector, these algorithms consume considerable time and cost, and lack a theoretical basis to handle large-scale data. Objectives: Based on mathematical theory, this study proposes a new approa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010